Estimating the quality of phonetic transcriptions and segmentations of speech signals

نویسندگان

  • Maria-Barbara Wesenick
  • Andreas Kipp
چکیده

Our approach to the problem of evaluating segmentations and transcriptions of speech data is presented. We developed an automatic pattern-matching procedure that relates different manual or automatic segmentations to each other. The comparison of segmentations refers to the degree of identity concerning the chosen labels and of identity of segment boundaries. As we exemplify our evaluation method on the basis of automatic transcriptions of the Munich AUtomatic Segmentation System (MAUS) that is currently being developed at the IPSK (Kipp et al. [4]) our data also give information on the quality of the system’s segmentation and transcription performance.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Comparison of Different Approaches to Automatic Speech Segmentation

We compare different methods for obtaining accurate speech segmentations starting from the corresponding orthography. The complete segmentation process can be decomposed into two basic steps. First, a phonetic transcription is automatically produced with the help of large vocabulary continuous speech recognition (LVCSR). Then, the phonetic information and the speech signal serve as input to a s...

متن کامل

Automatic phonetic transcription of large speech corpora: a comparative study

This study investigates whether automatic transcription procedures can approximate manual phonetic transcriptions typically delivered with contemporary large speech corpora. We used ten automatic procedures to generate a broad phonetic transcription of well-prepared speech (read-aloud texts) and spontaneous speech (telephone dialogues). The resulting transcriptions were compared to manually ver...

متن کامل

Validation of phonetic transcriptions based on recognition performance

In fundamental linguistic as well as in speech technology re­ search there is an increasing need for procedures to automat­ ically generate and validate phonetic transcriptions. Whereas much research has already focussed on the automatic genera­ tion o f phonetic transcriptions, far less attention has been paid to the validation of such transcriptions. In the little research performed in this a...

متن کامل

How to Improve Human and Machine Transcriptions of Spontaneous Speech

This paper reports on an experiment aimed at measuring the quality o f automatic and human phonetic transcriptions of different speech styles that were produced within the framework o f a large speech corpus project for Dutch, the Spoken Dutch Corpus (C orpus Gesproken Nederlands, CGN). The results indicate that the procedure adopted in the CGN to improve the quality o f phonetic transcriptions...

متن کامل

مقایسه روش های طیفی برای شناسایی زبان گفتاری

Identifying spoken language automatically is to identify a language from the speech signal. Language identification systems can be divided into two categories, spectral-based methods and phonetic-based methods. In the former, short-time characteristics of speech spectrum are extracted as a multi-dimensional vector. The statistical model of these features is then obtained for each language. The ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1996